Text independent speaker recognition using micro-prosody

نویسندگان

Youn-Jeong Kyung

Hwang-Soo Lee

چکیده

The acoustic aspects that differentiate voices are difficult to separate from signal traits that reflect the identity of the sounds. There are two sources of variation among speakers: (1) differences in vocal cords and vocal tract shape, and (2) differences in speaking style. The latter includes variations in both target vocal tract positions for phonemes and dynamic aspects of speech, such as speaking rate. However, most parameters and features are in the former. In this paper, we propose the use of a prosodic feature that represents micro prosody of utterances. The robustness of the prosodic feature on noise environment becomes clear. Also we propose a combined model. The combined model uses both the spectral feature and the prosodic feature. In our experiments, this model provides robust speaker recognition in noise environments.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

MIMIC : a voice-adaptive phonetic-tree speech synthesiser

This paper presents Mimic : a decision-tree based concatenative voice adaptive text to speech synthesiser. Mimic integrates text to speech synthesis (TTS) with speech recognition and speaker adaptation. Speech is synthesised from concatenation of triphone synthesis units. The triphone units are obtained from clusters of training examples modelled, labelled and segmented using clustered HMMs and...

متن کامل

Prosodic features based on wavelet analysis for speaker verification

Most conventional speaker recognition systems rely on short-term spectral information. But they ignore the long-term information such as prosody which also conveys speaker information. In this paper, we propose an approach that extracts prosodic features based on long-term information. First, by making wavelet analysis, we can reveal the trends of the f0 and energy contour. Subsequently, the pr...

متن کامل

Using Voice Quality Features to Improve Short-Utterance, Text-Independent Speaker Verification Systems

Due to within-speaker variability in phonetic content and/or speaking style, the performance of automatic speaker verification (ASV) systems degrades especially when the enrollment and test utterances are short. This study examines how different types of variability influence performance of ASV systems. Speech samples (< 2 sec) from the UCLA Speaker Variability Database containing 5 different r...

متن کامل

Speaker Independent Speech Recognition Using Hidden Markov Models for Persian Isolated Words

متن کامل

Speaker Independent Speech Recognition Using Hidden Markov Models for Persian Isolated Words

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1998

Text independent speaker recognition using micro-prosody

نویسندگان

چکیده

منابع مشابه

MIMIC : a voice-adaptive phonetic-tree speech synthesiser

Prosodic features based on wavelet analysis for speaker verification

Using Voice Quality Features to Improve Short-Utterance, Text-Independent Speaker Verification Systems

Speaker Independent Speech Recognition Using Hidden Markov Models for Persian Isolated Words

Speaker Independent Speech Recognition Using Hidden Markov Models for Persian Isolated Words

عنوان ژورنال:

اشتراک گذاری